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1  Work  Performed  within  This  Reporting  Period 

In  this  reporting  period,  we  performed  the  following  tasks. 

•  Developed  YouTube  Analytics  Capabilities.  We  have  developed  computation 
of  YouTube  basic  statistics  including  Top  Videos,  Top  Users,  Top  Words,  and 
Top  Languages,  and  also  applied  NER  to  the  text  associated  with  YouTube  posts. 
We  have  also  developed  UI  for  interactive  display,  and  designed  map  and  media 
gallery  views. 

•  Released  Scraawl  1.16. 

1.1  Development  of  YouTube  Analytics  and  UI 

1.1.1  YouTube  Top  K  Statistics  Computation  and  View 

We  have  developed  a  fast  and  reliable  computation  of  Top  K  statistics  for  YouTube 
videos.  In  particular,  we  compute  Top  Videos,  Top  Users,  Top  Words,  and  Top 
Languages.  In  all  cases,  “Top”  refers  to  higher  count.  Similar  to  other  data  sources,  we 
have  also  implemented  a  UI  to  show  and  interact  with  these  statistics.  Figure  1  shows  a 
representative  view  of  the  UI.  Top  Videos,  Top  Users,  and  Top  Words  are  shown  as  lists 
while  Top  Languages  are  shown  as  a  pie  chart.  Every  Top  K  Statistics  also  include  a 
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drill-down  menu,  where  users  can  access  using  the  “Details”  button.  In  the  drill-down 
menu,  users  can  see  Top  50  statistics  and  other  relevant  metrics.  Also,  users  can  filter  to 
include  or  exclude  the  relevant  posts. 


Basic  Statistics 

Sources:  ■  Folder:  Search  Reports  ■  Report  name:  Pofcemafi  GO  ■  Search  terms:  Pofcemon  GO 

liM  Basic  Statistics 


Status:  Completed  -  Posts:  100  ■  Updated:  20  Juif  2016  10:20  Timeline:  14  Jut,  2016  19:46  -  19Juf,  2016  18:00  ft  Ac:  :rs  - 


FA5TE5TWAY  TO  CATCH  PQ.. 
POKEMON  GO  IS  QANGEROLL. 
Pokemon  GO  -  HOWTO  CATC.., 
HOWTO  CATCH  MEWTWO!?... 

I  HATE  POKEMON  GO!!! 


iOO  taisi  videos 


Top  Languages 


H  French, 

□  Spanish 

□  PurtuguESE 
Ini  German 


9  total  languages 


Details  > 


Figure  1:  Representative  Top  K  Statistics  for  YouTube  Videos. 


1.1.2  YouTube  Timeline  View 

Similar  to  the  other  data  feeds,  we  have  developed  a  timeline  view  that  shows  the  number 
of  posts.  The  timeline  view  is  interactive  and  the  selected  portion  can  be  filtered  using 
“Filter  By  Range”  as  shown  in  Figure  2. 


Timeline 


1.1.3  YouTube  Media  Gallery  View 

We  show  the  snapshots  of  the  Top  50  videos  in  a  grid  under  “Media  Gallery”  view.  The 
video  snapshots  are  clickable  and  each  click  directs  the  user  to  a  page  that  has  statistics 
about  video.  A  representative  media  gallery  view  is  shown  in  Figure  3. 
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Media 


Total  media:  50 


Page:  1/2  Next> 


Page:  1/2 


Next  > 


Last » 


Figure  3:  Representative  Media  Gallery  View. 


1.1.4  YouTube  Named  Entity  Recognition  (NER)  View 

We  perform  Scraawl’s  NER  algorithm  to  the  text  associated  with  YouTube  post,  which 
classifies  the  named  entities  into  organizations,  locations,  and  persons.  Scraawl’s  NER 
also  performs  abbreviation  extensions,  e.g.,  UN  is  mapped  to  “United  Nations.”  A 
representative  NER  view  along  with  other  statistics  is  shown  in  Figure  4. 


Status:  Completed  Posts:  100  Updated:  20  Jul,  2016  10:20 


Timeline:  14  Jul,  201 6  19:46-  19  Jul,  2016  18:00  O  Actions- 


Top  Named  Entities 


Page:  1/8 

Next  > 

Last » 

Show  j  All  | 

|Search  entity 

□ 

|  T  Filter  | 

Entity 

i  United  Nations 
Society  Of  Indexers 

Massachusetts  Institute  Of  Technology 

United  States 

LeJeu 


Type 

55  Organization 
55  Organization 
55  Organization 
9  Location 
A  Person 


Entity  count 

20 

6 


»  %  of  entities 

50.0% 

15.0% 

10.0% 

17.4% 

4.2% 


%  of  posts 

10.0% 

5.0% 

3.0% 

4.0% 

3.0% 


i  %  of  users 

14.4% 

5.6% 

3.3% 

4.4% 

3.3% 


Figure  4:  Representative  NER  View. 


1.1.5  YouTube  Map  View 

We  use  Scraawl’s  Location  Map  and  Heat  Map  view  to  display  geolocation  information 
about  YouTube  videos.  In  particular,  we  show  geo-coded  and  geo-referenced  YouTube 
posts.  Geo-coded  posts  are  those  which  have  GPS  information  on  them  while  geo- 
referenced  posts  are  those  which  mention  location  information.  A  representative  Map 
view  is  shown  in  Figure  5. 
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Figure  5:  Representative  Map  View. 

2  Current  Problems 

None. 

3  Work  to  be  Performed  in  the  Next  Reporting  Period 

In  the  next  report  period,  we  will  focus  on  the  following  tasks: 

•  We  will  develop  an  API  for  news  feed  collection. 

•  We  will  deliver  Scraawl  1.17. 
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